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10. Method for 
exchanging 
messages in an 
integrated circuit 
comprising a 
plurality of 
modules, 


Without conceding that the preamble of claim 10 of the ‘449 Patent is limiting, the Samsung 
Galaxy A53 (hereinafter, the “Samsung product”) performs a method for exchanging messages in 
an integrated circuit comprising a plurality of modules, either literally or under the doctrine of 
equivalents. 


The Samsung product includes an integrated circuit. For example, the Samsung product includes 
the Exynos 1280 system on chip (hereinafter, the “Exynos SoC”). 


Samsung Galaxy A53 


Exynos 1280 


https: / /semiconductor.samsung.com/ processor/showcase/smartphone 


1 The Samsung product is charted as a representative product made used, sold, offered for sale, and/or imported by Samsung. The citations to evidence contained herein are 
illustrative and should not be understood to be limiting. The right is expressly reserved to rely upon additional or different evidence, or to rely on additional citations to the evidence 


cited already cited herein. 
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The Exynos SoC comprises a plurality of modules, for example Arm Cortex-A78 core, Cortex-A55 
core, Arm Mali-G68 GPU, and AI Engine with NPU: 


Specifications 


Exynos 1280 


cPU Cortex®-A78 x 2 + Cortex®-A55 x 6 
GPU Mali™-G68 
Al Al Engine with NPU 


5G NR Sub-6GHz 2.55Gbps (DL) / 1.28Gbps (UL) 
Modem 5G NR mmWave 1.84Gbps (DL) / 0.92Gbps (UL) 
LTE Cat.18 6CC 1.2Gbps (DL) / Cat.18 2CC 200Mbps (UL) 


Connectivity WiFi 802.1ac MIMO with Dual-band (2.4/56G), 
Bluetooth” 5.2, FM Radio Rx 

GNSS Quad-constellation multi-signal for LI and LS GNSS 

Cannan Up to 108MP in single camera mode, 
Single-camera 32MP @30fps 

Video 4K 30fps encoding and decoding 

Display Full HD+@120Hz 

Memory LPDDR4x 

Storage UFS v2.2 

Process 5nm 


https:/ /semiconductor.samsung.com/resources/ brochure /Exynos1280.pdf 


The Exynos SoC included in the Samsung product utilizes Arteris network on chip interconnect 
technology, and/or a derivative thereof, (collectively, the “Arteris NoC”) to exchange messages: 
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samsung 


Samsung uses Arteris FlexNoC IP in its 
Samsung Exynos mobile phone 
applications processors, digital baseband 
modems, 4K SUHD TVs and Artik loT 
modules. 


LEARN MORE » 


https: / /web.archive.org/web/20210514110614/https: //www.arteris.com/customers 
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Arteris IP FlexNoC® Interconnect Licensed by 


Samsung's System LSI Business for Digital TV 
Chips 


CAMPBELL, Calif. -April 23, 2019- Arteris IP, the world’s leading supplier of innovative, silicon-proven network- 
on-chip (NoC) interconnect semiconductor intellectual property, today announced that Samsung's System LSI 
Business has renewed multiple Arteris IP FlexNoc Interconnect licenses for use in multiple high-performance 
digital TV (DTV) processing chips utilizing Samsung's latest semiconductor technology process nodes. 


6G Over many years, FlexNoC interconnect IP has helped us accelerate 
implementation of our digital TV chip designs on our latest semiconductor 
process nodes. This core interconnect technology is required to develop 
complex and highly optimized chips in a predictable, low-risk fashion.” 


SAMSUNG 


Joeyoul Lee, Vice President, Samsung Electronics 


Samsung first licensed FlexNoC interconnect IP in 2010. Since then, Samsung has used Arteris interconnect IP to 
enable complex SoC architectures in chips like the ERYHOSHMODNEIPEOlessOrs and other electronic systems. 


https: //www.arteris.com/ press-releases /samsung-lsi-dtv-arteris-ip-flexnoc 


4865-8602-9633.4 


Case 2:22-cv-00481-JRG Document 1-14 Filed 12/19/22 Page 6 of 54 PagelD #: 542 


U.S. Patent No. 7,373,449 (Radulescu and Goossens) 
“Apparatus and method for communicating in an integrated circuit” 


‘449 Patent Claim Samsung Product Including Exynos System on Chip! 


Arteris Interconnect IP Solution Selected by 
Samsung for Mobile SoC Deployment 


by Kurt Shuler, on November 02, 2010 


Network-on-Chip (NoC) interconnect technology leader enables higher performance and more cost 
effective designs for mobile phone systems-on-chip (SoCs) 


SUNNYVALE, California — November 2, 2010 — Arteris, Inc., a leading supplier of on-chip interconnect IP 
solutions, today announced that Samsung Electronics Co., Ltd., has selected Arteris’ interconnect solutions for 
multiple chips within Samsung’s mobile SOC products. Samsung chose Arteris interconnect IP to support the 
high speed inter-chip communication requirements in next generation mobile SOC products. 


@G The Arteris interconnect IP offers us a convenient solution to handle the high 
speed communication needed between our SoC and external modem IC. Our 
customers will benefit from the lower BOM cost and power consumption as a 
result of this IP. We look forward to Arteris’ interconnect IP helping us 
shorten development schedules and lower risks associated with 


compatibility. 


Thomas Kim, Vice President, SoC Platform Development, System LSI, Samsung Electronics 


https: / /www.arteris.com/ press-releases/pr_2010 nov_02?hsLang=en-us 
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The Arteris NoC exchanges messages in the Exynos SoC included in the Samsung product. 


For example, in the Arteris NoC, “[m]ost transactions require the following two-step transfers,” 
including “[a] master send[ing] request packets” and “the slave return[ing] response packets”: 


11.3.1.1 Transaction Layer 


The transaction layer is compatible with bus-based transaction protocols used 
for on-chip communications. It is implemented in NIUs, which are at the 
boundary of the NoC, and translates between third-party and NTTP proto- 
cols. Most transactions require the following two-step transfers: 


e A master sends request packets. 


e Then, the slave returns response packets. 


As shownin Figure 11.1, requests from an initiator are sent through the master 
NIU’s transmit port, Tx, to the NoC request network, where they are routed to 
the corresponding slave NIU. Slave NIUs, upon reception of request packets 
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on their receive ports, Rx, translate requests so that they comply with the pro- 
tocol used by the target third-party IP node. When the target node responds, 
returning responses are again converted by the slave NIU into appropriate 
response packets, then delivered through the slave NIU’s Tx port to the 
response network. The network then routes the response packets to the re- 
questing master NIU, which forwards them to the initiator. At the transaction 
level, NIUs enable multiple protocols to coexist within the same NoC. From 
the point of view of the NTTP modules, different third-party protocols are 
just packets moving back and forth across the network. 
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FIGURE 11.1 
NTTP protocol layers mapped on NoC units and Media Independent NoC Interface—MINI. 


See Networks-On-Chips Theory and Practice, https:/ /vdoc.pub/download/networks-on-chips- 
theory-and-practice-embedded-multi-core-systems-6f26givv11f0, at 312-313; see id at 308 
(explaining that Chapter 11 of this book describes the function of the Arteris NoC: “In this chapter 
we will present an MPSoC platform [...] using Arteris NoC as communication infrastructure.”). 


the messages 
between the 
modules being 


Without conceding that the preamble of claim 10 of the “449 Patent is limiting, the Arteris NoC 
exchanges messages between modules in the Exynos SoC included in the Samsung product over 
connections via a network, wherein said connections comprises a set of communication channels 


4865-8602-9633.4 


9 


Case 2:22-cv-00481-JRG Document 1-14 Filed 12/19/22 Page 10 of 54 PagelD #: 546 


U.S. Patent No. 7,373,449 (Radulescu and Goossens) 
“Apparatus and method for communicating in an integrated circuit” 


‘449 Patent Claim Samsung Product Including Exynos System on Chip! 


exchanged over 
connections via a 
network, wherein 
said connections 
comprises a set of 
communication 
channels each 
having a set of 
connection 
properties, any 
communication 
channel being 
independently 
configurable, 


each having a set of connection properties any communication channel being independently 
configurable, either literally or under the doctrine of equivalents. 


A large SoC, such as the Exynos SoC included in the Samsung product may include multiple 
classes of Arteris NoC interconnect network: 


Logical Interconnect Topology Development 
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e ArChipi6 Example: Large SoCs have multiple classes of interconnect 
— Non-coherent, Coherent, Control/Status, Observability, etc. 
e Necore & FlexNoC interconnects are managed separately from IP blocks, increasing design flexibility 


ISPD 2018, 28 March 2018 Copyright © 2018 ArterisIP | 9 


ARTERIS@ 


See Physical Interconnect Aware Network Optimizer, http://www.ispd.cc/slides/2018/s7_2.pdf, 
at slide 9. 
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The Exynos SoC included in the Samsung product utilizes the Arteris NoC to exchange messages 
over connections via a network, wherein said connections comprises a set of communication 
channels that are independently configurable. 


For example, in the the Arteris NoC, “[a]n NTTP transaction is typically made of request packets, 
traveling through the request network between the master and the slave NIUs, and response 
packets that are exchanged between a slave NIU and a master NIU through the response 
network.... Transactions are handed off to the transport layer, which is responsible for delivering 
packets between endpoints of the NoC (using links, routers, muxes, rated adapters, FIFOs, etc.). 
Between NoC components, packets are physically transported as cells across various interfaces, a 
cell being a basic data unit being transported. This is illustrated in Figure 11.1, with one master 
and one slave node, and one router in the request and response path.” 


11 
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FIGURE 11.1 
NTTP protocol layers mapped on NoC units and Media Independent NoC Interface—MINI. 


See Networks-On-Chips Theory and Practice, https: 
theory-and-practice-embedded-multi-core-systems-6f26givv11f0, at 312-313. 


Connections within the Arteris NoC network may be defined by a connectivity table: 
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» Routes must pass through available channels in the floorplan 
* Connectivity passes from initiator NIU to switch, to link, to RC buffers and finally to target NIU 


Copyright © 2018 Arteris IP | 12 


AR TERISM@@ ISPD 2018, 28 March 2018 
See Physical Interconnect Aware Network Optimizer, http://www.ispd.cc/slides/2018/s7_2.pdf 
at slide 12. 


In the Arteris NoC, “[t]he delivery of packets within the NoC is the responsibility of the physical 
layer [where the] link size, or width (i.e., number of wires), is set by the designer at design 
time[and] [o]ne link (represented in Figure 11.1) defines the following signals... Pres. — Indicates 
the current priority of the packet used to define preferred traffic class (or Quality of Service). The 
width is fixed during the design time, allowing multiple pressure levels within the same NoC 
instance (bits 3-5 in Figure 11.2) [and] RxRdy — flow control”: 


13 
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11.3.1.3 Physical Layer 


The delivery of packets within the NoC is the responsibility of the physical 
layer. Packets, which have been split by the transport layer into cells, are 
delivered as words that are sent along links. Within a single clock cycle, the 
physical layer may carry words comprising a fraction of a cell, a single cell, 
or multiple cells. The link size, or width (i.e., number of wires), is set by the 
designer at design time and determines the number of cells of one word. 
NTTP defines five possible link-widths: quarter (QRT), half (HLF), single 
(SGL), double (DBL), and quad (QUAD). A single-width (SGL) link transmits 
one cell per clock cycle, a double-width link transmits two cells per clock cycle, 
and so on. Words travel within point-to-point links, which are independent 
from other protocol layers: a word is sent through a transmit port, Tx, over a 
link to a receive port, Rx. The actual number of wires in a link depends on the 
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maximum cell-width (header, necker, and data cell) and the link-width. One 
link (represented in Figure 11.1) defines the following signals: 


Data—Data word of the width specified at design-time. 


Frm—When asserted high, indicates that a packet is being transmit- 
ted. 


Head—When asserted high, indicates the current word contains a 
packet header. When the link-width is smaller than single (SGL), the 
header transmission is split into several word transfers. However, 
the Head signal is asserted during the first transfer only. 


TailOfs—Packet tail: when asserted high, indicates that the current 
word contains the last packet cell. When the link-width is smaller 
than single (SGL), the last cell transmission is split into several word 
transfers. However, the Tail signal is asserted during the first transfer 
only. 

Pres.—Indicates the current priority of the packet used to define 
preferred traffic class (or Quality of Service). The width is fixed 
during the design time, allowing multiple pressure levels within 
the same NoC instance (bits 3-5 in Figure 11.2). 


V1ld—Data valid: when asserted high, indicates that a word is being 
transmitted. 

RxRdy—Flow control: when asserted high, the receiver is ready to 
accept word. When de-asserted, the receiver is busy. 


This signal set, which constitutes the Media Independent NoC Interface 
(MINI), is the foundation for NTTP communications. 
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See Networks-On-Chips Theory and Practice, https:/ /vdoc.pub/download/networks-on-chips- 
theory-and-practice-embedded-multi-core-systems-6f26givv11f0, at 313-314. 


The Exynos SoC included in the Samsung product utilizes the Arteris NoC’s connections that 
comprise a set of communication channels each having a set of connection properties, any 
communication channel being independently configurable. 


For example, as noted above, in the Arteris NoC, “[o]ne link (represented in Figure 11.1) defines 
the following signals... Pres. — Indicates the current priority of the packet used to define preferred 
traffic class (or Quality of Service) [and] RxRdy —flow control.” 


In the Arteris NoC implements Quality of Service (QoS) to “provides a regulation mechanism 
allowing specification of guarantees on some of the parameters related to the traffic”: 


Quality of Service (QoS). The QoS is a very important feature in the inter- 
connect infrastructures because it provides a regulation mechanism allowing 
specification of guarantees on some of the parameters related to the traf- 
fic. Usually the end users are looking for guarantees on bandwidth and/or 
end-to-end communication latency. Different mechanisms and strategies have 
been proposed in the literature. For instance, in Aithereal NoC [11,24] pro- 
posed by NXP, a TDMA approach allows the specification of two traffic cat- 
egories [25]: BE and GT. 

In the Arteris NoC, the QoS is achieved by exploiting the signal pressure em- 
bedded into the NTTP packet definition (Figures 11.1 and 11.2). The pressure 
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signal can be generated by the IP itself and is typically linked to a certain level 
of urgency with which the transaction will have to be completed. For exam- 
ple, we can imagine associating the generation of the pressure signal when a 
certain threshold has been reached in the FIFO of the corresponding IP. This 
pressure information will be embedded in the NTTP packet at the NIU level: 
packets that have pressure bits equal to zero will be considered without QoS; 
packets with a nonzero value of the pressure bit will indicate preferred traffic 
class.* Such a QoS mechanism offers immediate service to the most urgent 
inputs and variables, and fair service whenever there are multiple contend- 
ing inputs of equal urgency (BE). Within switches, arbitration decisions favor 
preferred packets and allocate remaining bandwidth (after preferred packets 
are served) fairly to contending packets. When there are contending preferred 
packets at the same pressure level, arbitration decisions among them are also 
fair. 
The Arteris NoC supports the following four different traffic classes: 
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e Real time and low latency (RTLL)—TIraffic flows that require the 
lowest possible latency. Sometimes it is acceptable to have brief 
intervals of longer latency as long as the average latency is low. 
Care must be taken to avoid starving other traffic flows as a side 
effect of pursuing low latency. 


¢ Guaranteed throughput (GT)—TIraffic flows that must maintain 
their throughput over a relatively long time interval. The actual 
bandwidth needed can be highly variable even over long intervals. 
Dynamic pressure is employed for this traffic class. 


e Guaranteed bandwidth (GBW)—Iraffic flows that require a guar- 
anteed amount of bandwidth over a relatively long time interval. 
Over short periods, the network may lag or lead in providing this 
bandwidth. Bandwidth meters may be inserted onto links in the 
NoC to regulate these flows, using either of the two methods. If the 
flow is assigned high pressure, the meter asserts backpressure (flow 
control) to prevent the flow from exceeding a maximum bandwidth. 
Alternatively, the meter can modulate the flows pressure (priority) 
dynamically as needed to maintain an average bandwidth. 


e Best effort (BE)—TIraffic flows that do not require guaranteed 
latency or throughput but have an expectation of fairness. 
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* Note that in the NTTP packet, the pressure field allows more then one bit, resulting in multiple 
levels of preferred traffic. 


Networks-On-Chips Theory and Practice, https:/ /vdoc.pub/download/networks-on-chips- 
theory-and-practice-embedded-multi-core-systems-6f26givv11f0, at 315-316. 


As a further illustration, the Arteris NoC “addresses ... varied QoS needs in many ways,” 
including “Dynamic Packet Priorities” and “Dynamic Pressure Propagation”: 


Arbitration: Dynamic Packet Priorities & Dynamic 
Pressure Propagation 


Arteris Network on Chip technology addresses these varied QoS needs in many 
ways: First, the interconnect assigns priorities to transactions to ensure they arrive 
at the target in the proper order to meet system requirements. Priority levels can be 
attached to individual packets or to all transactions pending on a socket. The 
interconnect can also assign Dynamic Packet Priorities at runtime. 


Second, the interconnect can sense when high priority packets may be blocked or 
slowed due to downstream traffic congestion and can then clear a path for these 
high priority packets. This technology, called Dynamic Pressure Propagation, is 
analogous to a fire truck racing down city streets: All traffic pulls to the side of the 
road to let the fire truck through. 


https: //www.arteris.com/end-to-end-quality-of-service-qos 
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As a further illustration, “QoS information may be generated from within the [Arteris] NoC 
interconnect using Arteris’ QoS Generator”: 


Bandwidth Limiters and Rate Regulators 


Many times architects will want to implement QoS within their SoC but the QoS 
prioritization data is not available from the individual IP blocks. In this case, QoS 
information may be generated from within the NoC interconnect using Arteris’ QoS 
Generator. The QoS Generator can instantiate sophisticated, and software 
programmable, means to regulate interconnect QoS, including: 


> Bandwidth Limiters - Bandwidth limiters cause a socket to stop accepting 
requests when a run-time programmable throughput threshold has been 
exceeded. 

> Rate Regulators - Rate regulators cause a socket's transactions to be demoted 
when a bandwidth threshold is reached. This can be considered a smoother 
version of the bandwidth limiter because transactions are only demoted 


instead of stalled. 


https: //www.arteris.com/end-to-end-quality-of-service-qos 


As a further illustration, the Arteris NoC uses “a mechanism called rated adaptation, which stalls 
packets just enough to remove wait states from the packets, preserving a low latency.” For other 
traffic, the “[b]est effort traffic can be left untouched|[,]” “[l]atency sensitive traffic may have its 
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urgency modulated as a function of the transaction[,]” “[s]oft real-time traffic may have its hurry 
level modulated as a function of the bandwidth it receives|,|” and “[o]n the real-time modem data 
port, the hurry is fixed at a critical level”: 


Those effects can be mended by the insertion of buffering. In the case of peak bandwidth 
reduction, a simple FIFO does the job: Busy states present at the output of the FIFO do 
not propagate back to the input until the FIFO is full. For a peak bandwidth increase, the 
situation is a bit more complex. In a FIFO, wait states present at the input are only absorbed 
when the FIFO is not empty. Arteris proposes a mechanism called rate adaptation, which 
stalls packets just enough to remove wait states from the packets, preserving a low latency. 

In this second step, the architecture is modified to introduce some buffering. In our ex- 
ample 760 bytes of memory have been distributed across the topology. Some have been put 
on existing links; some required the creation of new links. 


See Application driven network-on-chip architecture exploration & refinement for a complex SoC, 


https:/ /www.arteris.com/hs-fs/hub/48858 / file-14363521- 
pdf/docs/springerappdrivennocarchitecture8.5x11.pdf, at pg.16. 


For the other traffic, “the configuration can be done in architecture”: 
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e Best effort traffic can be left untouched. 

e Latency sensitive traffic may have its urgency modulated as a function of the transaction: 
Normal for writes and important for reads. 

e Soft real-time traffic may have its hurry level modulated as a function of the bandwidth 
it receives: Critical until a specified bandwidth is obtained on a sliding 4 microsecond 
window, and normal thereafter. These settings are set through configuration registers and 
may be modified while the interconnect is running. The mechanism is called a bandwidth 
regulator. 

e On the real-time modem data port, the hurry is fixed at a critical level. 


Id. at 18. 


As a further illustration, connections within the Arteris NoC may be classified by traffic class and 
traffic classes may be mapped onto the Arteris interconnect topology: 
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Memory NoC: 
Interconnect Topology — Traffic Classes 


Classify your IP connections per class of 
traffic: 


Best Effort (BE) Image system | 
Low Latency (LL) SRAM 
High Bandwidth (HB) Main/Coherency 


Modifications 


Defer | cir 


|cswo—~—“‘SSC*diS COB. BE. BEE BE 

|psvw/o=F——“(‘i‘“‘;Ssé*S|:SCSCCBE BE BE BE 

[FromcohNocwen#70 | LL HB HB HB HB 7 

| FromMainNoC//o | LL HB HB HB HB ¥ 

fis CC. BE BE BE BE 
ARTERIS@ ISPD 2018, 28 March 2018 Gopyright © 2018 Arteris IP | 13 


23 


4865-8602-9633.4 


Case 2:22-cv-00481-JRG Document 1-14 Filed 12/19/22 Page 24 of 54 PagelD #: 560 


U.S. Patent No. 7,373,449 (Radulescu and Goossens) 
“Apparatus and method for communicating in an integrated circuit” 


449 Patent Claim Samsung Product Including Exynos System on Chip! 


Memory NoC: 
Traffic classes are mapped onto logical interconnect topology 


ARTERISiMa ISPD 2018, 28 March 2018 Copyright © 2018 Arteris IP | 16 
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Memory Access Traffic Classes 


— Cache Coherent (CC) 
— Low Latency (LL) 


g 
®) Ly CSRNoC 


+ s 3 . . 
222 8 — High Bandwidth (HB) 
A A Al |3) > 
_— GPU DsP = (eel E — Best Effort (BE) 
CPU CPU 
big ite MMU MMU a cE PeripherallPs w Configuration 
g 
P 


PO) NN 


ops |P 


ARTERIS@ 


at slides 11, 


See Physical Interconnect Aware Network Optimizer, http: 


ISPD 2018, 28 March 2018 


13, 16. 


¢ Cache Coherent (CC) 


within Compute Cluster 


° Low Latency (LL) to 


« 


SRAM 


High Bandwidth (HB) 
to DRAM & Cache Fill 


Best Effort (BE) for 
Peripherals & DMA 


e QoS for Video 


e Multiple functional 


NoCs interacting 


e Physically Constrained 


Copyright © 2018 Arteris IP | 11 


www.ispd.cc/slides/2018/s7_2.pd 


As a further illustration, in the Arteris NoC, “QoS is supported in the switch using pressure 
information generated by the IP itself and embedded in NTTP packets” and “[s]ome features [of 
the switch] can be software-controlled at runtime through the service network”: 
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11.3.3.1 Switching 


The switching is done by accepting NTTP packets carried by input ports and 
forwarding each packet transparently to a specific output port. The switch is 
characterized with a fully synchronous operation and can be implemented 
as a full crossbar (up to one data word transfer per port and per cycle), 
although there is an automatic removal of hardware corresponding to unused 
input/output port connections (port depletion). The switch uses wormhole 
routing, for reduced latency, and can provide full throughput arbitration; that 
is, up to one routing decision per input port and per cycle. An arbitrary num- 
ber of switches can be connected in cascade, supporting any loopless network 
topology. The QoS is supported in the switch using the pressure information 
generated by the IP itself and embedded in NTTP packets. 

A switch can be configured to meet specific application requirements by 
setting the MINI-ports (Rx or Tx ports, as defined by the MINI interface 
introduced earlier) attributes, routing tables, arbitration mode, and pipelining 
strategy. Some of the features can be software-controlled at runtime through 
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the service network. There is one routing table per Rx port and one arbiter 
per Tx port. Packet switching consists of the following four stages: 


1. Choosing the route—Using relevant information extracted from 
the packet, the routing table selects a target output port. 


2. Arbitrating—Because more than a single input port can request a 
given output port at a given time, an arbiter selects one request- 
ing input port per output port. The arbiter maintains input/output 
connection until the packet completes its transit in the switch. 


3. Switching—Once routing and arbitration decisions have been made, 
the switch transports each word of the packet from its input port to 
its output port. The switch implementation employs a full crossbar, 
ensuring that the switch does not contribute to congestion. 


4. Arbiter release—Once the last word of a packet has been pipelined 


into the crossbar, the arbiter releases the output, making it available 
for other packets that may be waiting at other input ports. 


The simplified block diagram of the switch architecture is shown in 
Figure 11.6. 
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Router Architecture 


Input Crossbar Output 
Controller i i Controller 


Control 
Connection 
Control Control 


Target Output Output Crossbar 
Address Number Request (Flow control) 


Route 
Table 


Configuration Supervision Registers | Buftering 
Pipeline stage 
it Service Bus [ r y 
FIGURE 11.6 


Packet transportation unit: Router architecture. 


See Networks-On-Chips Theory and Practice, https:/ /vdoc.pub/download/networks-on-chips- 
theory-and-practice-embedded-multi-core-systems-6f26givv11f0, at 319-320. 
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As a further illustration, the “Pres.” signal in the NITP packet “[i]ndicates the current priority of 
the packet used to define preferred traffic class (or Quality of Service). The width is fixed during 
the design time, allowing multiple pressure levels within the same NoC instance (bits 3-5 in 


Figure 11.2).” 


35 29 28 25 24 15 14 543 0 

Header Master Address Slave Address Pr] Opcode _] 
Necker [___Tag emf ——S——~CSSrveofiset——SSSCSC~*~SCS Stat] StOPO 
Data [BE[ DataByte [BE] DataByte [BE] DataByte [BE] DataByte 
Data [BEL Data Byte BE. Data Byte ]BE[ DataByte [BE] DataBye 
32 3130 27 26 20 19 14 13 ie as | 0 

Header Rev [| Len Master Address [Pr] Opcode 


Data CEP Data 
Data 
FIGURE 11.2 

NTTP packet structure. 

See id. at 313, 314. 

As a further illustration, in the Arteris NoC, “the routing tables actually used in the switch are 


for each switch input. Routing tables can optionally be programmed via the service network 
interface; in this case, their configuration registers appear in the switch register address map.” 


See id. at 322. 
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wherein said Without conceding that the preamble of claim 10 of the ‘449 Patent is limiting, the Arteris NoC in 
connection the Exynos SoC included in the Samsung product has connections through the network that 
through the support transactions comprising at least one of outgoing messages from the first module to the 


network supports | second module and return messages from the second module to the first module, either literally 
transactions or under the doctrine of equivalents. 

comprising at least 
one of outgoing For example, in the Arteris NoC used by the Exynos SoC included in the Samsung product, 
messages from the | “[mlJost transactions require the following two-step transfers,” including “[a] master send[ing] 
first module to the | request packets” and “the slave return[ing] response packets”: 

second module 


viene 11.3.1.1 Transaction Layer 

messages from the 

second module to The transaction layer is compatible with bus-based transaction protocols used 
the first module for on-chip communications. It is implemented in NIUs, which are at the 


boundary of the NoC, and translates between third-party and NTTP proto- 
cols. Most transactions require the following two-step transfers: 


e A master sends request packets. 


e Then, the slave returns response packets. 


As shown in Figure 11.1, requests from an initiator are sent through the master 
NIU’s transmit port, Tx, to the NoC request network, where they are routed to 
the corresponding slave NIU. Slave NIUs, upon reception of request packets 
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on their receive ports, Rx, translate requests so that they comply with the pro- 
tocol used by the target third-party IP node. When the target node responds, 
returning responses are again converted by the slave NIU into appropriate 
response packets, then delivered through the slave NIU’s Tx port to the 
response network. The network then routes the response packets to the re- 
questing master NIU, which forwards them to the initiator. At the transaction 
level, NIUs enable multiple protocols to coexist within the same NoC. From 
the point of view of the NTTP modules, different third-party protocols are 
just packets moving back and forth across the network. 
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FIGURE 11.1 
NTTP protocol layers mapped on NoC units and Media Independent NoC Interface—MINI. 


See Networks-On-Chips Theory and Practice, https:/ /vdoc.pub/download/networks-on-chips- 
theory-and-practice-embedded-multi-core-systems-6f26givv11f0, at 312-313. 


and further 
comprising the 
steps of: 


the first module 
issuing a request 


In the Arteris NoC in the Exynos SoC included in the Samsung product, the first module issues a 
request for a connection with the second module to a communication manager, wherein the 
request comprises desired connection properties associated with the sets of communication 
channels, either literally or under the doctrine of equivalents. 
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for a connection 
with the second The first module of the Exynos SoC included in the Samsung product utilizes the Arteris NoC to 


module to a issue a request for a connection with the second module to a communication manager. 
communication 

manager, wherein | For example, in the Arteris NoC, “[mlJost transactions require the following two-step transfers,” 
the request including “[a] master send[ing] request packets” and “the slave return[ing] response packets”: 
comprises desired 

Sean 11.3.1.1 Transaction Layer 

properties 

associated with The transaction layer is compatible with bus-based transaction protocols used 
the sets of for on-chip communications. It is implemented in NIUs, which are at the 
a boundary of the NoC, and translates between third-party and NTTP proto- 
channels; 


cols. Most transactions require the following two-step transfers: 


e A master sends request packets. 


e Then, the slave returns response packets. 


As shown in Figure 11.1, requests from an initiator are sent through the master 
NIU’s transmit port, Tx, to the NoC request network, where they are routed to 
the corresponding slave NIU. Slave NIUs, upon reception of request packets 
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on their receive ports, Rx, translate requests so that they comply with the pro- 
tocol used by the target third-party IP node. When the target node responds, 
returning responses are again converted by the slave NIU into appropriate 
response packets, then delivered through the slave NIU’s Tx port to the 
response network. The network then routes the response packets to the re- 
questing master NIU, which forwards them to the initiator. At the transaction 
level, NIUs enable multiple protocols to coexist within the same NoC. From 
the point of view of the NTTP modules, different third-party protocols are 
just packets moving back and forth across the network. 
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FIGURE 11.1 
NTTP protocol layers mapped on NoC units and Media Independent NoC Interface—MINI. 


See Networks-On-Chips Theory and Practice, https:/ /vdoc.pub/download/networks-on-chips- 
theory-and-practice-embedded-multi-core-systems-6f26givv11f0, at 312-313. 


The request issued by first module of the Exynos SoC included in the Samsung product comprises 
desired connection properties associated with the sets of communication channels. 


For example, in the Arteris NoC, “[t]he delivery of packets within the NoC is the responsibility of 
the physical layer [where the] link size, or width (i.e., number of wires), is set by the designer at 
design time[and] [o]ne link (represented in Figure 11.1) defines the following signals... Pres. — 
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Indicates the current priority of the packet used to define preferred traffic class (or Quality of 
Service). The width is fixed during the design time, allowing multiple pressure levels within the 
same NoC instance (bits 3-5 in Figure 11.2) [and] RxRdy — flow control”: 


11.3.1.3 Physical Layer 


The delivery of packets within the NoC is the responsibility of the physical 
layer. Packets, which have been split by the transport layer into cells, are 
delivered as words that are sent along links. Within a single clock cycle, the 
physical layer may carry words comprising a fraction of a cell, a single cell, 
or multiple cells. The link size, or width (i.e., number of wires), is set by the 
designer at design time and determines the number of cells of one word. 
NTTP defines five possible link-widths: quarter (QRT), half (HLF), single 
(SGL), double (DBL), and quad (QUAD). A single-width (SGL) link transmits 
one cell per clock cycle, a double-width link transmits two cells per clock cycle, 
and so on. Words travel within point-to-point links, which are independent 
from other protocol layers: a word is sent through a transmit port, Tx, over a 
link to a receive port, Rx. The actual number of wires in a link depends on the 
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maximum cell-width (header, necker, and data cell) and the link-width. One 
link (represented in Figure 11.1) defines the following signals: 


Data—Data word of the width specified at design-time. 


Frm—When asserted high, indicates that a packet is being transmit- 
ted. 


Head—When asserted high, indicates the current word contains a 
packet header. When the link-width is smaller than single (SGL), the 
header transmission is split into several word transfers. However, 
the Head signal is asserted during the first transfer only. 


TailOfs—Packet tail: when asserted high, indicates that the current 
word contains the last packet cell. When the link-width is smaller 
than single (SGL), the last cell transmission is split into several word 
transfers. However, the Tail signal is asserted during the first transfer 
only. 

Pres.—Indicates the current priority of the packet used to define 
preferred traffic class (or Quality of Service). The width is fixed 
during the design time, allowing multiple pressure levels within 
the same NoC instance (bits 3-5 in Figure 11.2). 


V1ld—Data valid: when asserted high, indicates that a word is being 
transmitted. 

RxRdy—Flow control: when asserted high, the receiver is ready to 
accept word. When de-asserted, the receiver is busy. 


This signal set, which constitutes the Media Independent NoC Interface 
(MINI), is the foundation for NTTP communications. 
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See Networks-On-Chips Theory and Practice, https:/ /vdoc.pub/download/networks-on-chips- 
theory-and-practice-embedded-multi-core-systems-6f26givv11f0, at 313-314. 


As a further example, in the Arteris NoC, “QoS is achieved by exploiting the signal pressure 
embedded into the NTTP packet definition” where the “pressure signal can be generated by the IP 
itself and is typically linked to a certain level of urgency with which the transaction will have to 
be completed” and the “pressure information will be embedded in the NTTP packet at the NIU 
level”: 


Quality of Service (QoS). The QoS is a very important feature in the inter- 
connect infrastructures because it provides a regulation mechanism allowing 
specification of guarantees on some of the parameters related to the traf- 
fic. Usually the end users are looking for guarantees on bandwidth and/or 
end-to-end communication latency. Different mechanisms and strategies have 
been proposed in the literature. For instance, in Aithereal NoC [11,24] pro- 
posed by NXP, a TDMA approach allows the specification of two traffic cat- 
egories [25]: BE and GT. 

In the Arteris NoC, the QoS is achieved by exploiting the signal pressure em- 
bedded into the NTTP packet definition (Figures 11.1 and 11.2). The pressure 
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signal can be generated by the IP itself and is typically linked to a certain level 
of urgency with which the transaction will have to be completed. For exam- 
ple, we can imagine associating the generation of the pressure signal when a 
certain threshold has been reached in the FIFO of the corresponding IP. This 
pressure information will be embedded in the NTTP packet at the NIU level: 
packets that have pressure bits equal to zero will be considered without QoS; 
packets with a nonzero value of the pressure bit will indicate preferred traffic 
class.* Such a QoS mechanism offers immediate service to the most urgent 
inputs and variables, and fair service whenever there are multiple contend- 
ing inputs of equal urgency (BE). Within switches, arbitration decisions favor 
preferred packets and allocate remaining bandwidth (after preferred packets 
are served) fairly to contending packets. When there are contending preferred 
packets at the same pressure level, arbitration decisions among them are also 
fair. 
The Arteris NoC supports the following four different traffic classes: 
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e Real time and low latency (RTLL)—TIraffic flows that require the 
lowest possible latency. Sometimes it is acceptable to have brief 
intervals of longer latency as long as the average latency is low. 
Care must be taken to avoid starving other traffic flows as a side 
effect of pursuing low latency. 


¢ Guaranteed throughput (GT)—TIraffic flows that must maintain 
their throughput over a relatively long time interval. The actual 
bandwidth needed can be highly variable even over long intervals. 
Dynamic pressure is employed for this traffic class. 


e Guaranteed bandwidth (GBW)—Iraffic flows that require a guar- 
anteed amount of bandwidth over a relatively long time interval. 
Over short periods, the network may lag or lead in providing this 
bandwidth. Bandwidth meters may be inserted onto links in the 
NoC to regulate these flows, using either of the two methods. If the 
flow is assigned high pressure, the meter asserts backpressure (flow 
control) to prevent the flow from exceeding a maximum bandwidth. 
Alternatively, the meter can modulate the flows pressure (priority) 
dynamically as needed to maintain an average bandwidth. 


e Best effort (BE)—TIraffic flows that do not require guaranteed 
latency or throughput but have an expectation of fairness. 
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* Note that in the NTTP packet, the pressure field allows more then one bit, resulting in multiple 
levels of preferred traffic. 


35 29 28 25 24 15 14 543 0 
Header Master Address Slave Address Prs|___Opcode__] 
Necker 
Data (BET __DataByte_ [BE] DataByte [BE] DataByte [BE] DataByte —_—| 
Data 
32 3130 27 26 20 19 14 13 543 0 

Header Master Address [Ps] Opcode 
Data a 
Data lat OOOOOCSCSCS—S 


FIGURE 11.2 
NTTP packet structure. 


Networks-On-Chips Theory and Practice, https:/ /vdoc.pub/download/networks-on-chips- 
theory-and-practice-embedded-multi-core-systems-6f26givv11f0, at 313, 315-316. 


As a further example, in the Arteris NoC, “QoS is supported in the switch using pressure 
information generated by the IP itself and embedded in NTTP packets” and the router 
architecture includes blocks such as “Input Controller,” “Flow Control,” “Crossbar (Flow 
control)” “Route Table” and “Arbiter”: 
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11.3.3.1 Switching 


The switching is done by accepting NTTP packets carried by input ports and 
forwarding each packet transparently to a specific output port. The switch is 
characterized with a fully synchronous operation and can be implemented 
as a full crossbar (up to one data word transfer per port and per cycle), 
although there is an automatic removal of hardware corresponding to unused 
input/output port connections (port depletion). The switch uses wormhole 
routing, for reduced latency, and can provide full throughput arbitration; that 
is, up to one routing decision per input port and per cycle. An arbitrary num- 
ber of switches can be connected in cascade, supporting any loopless network 
topology. The QoS is supported in the switch using the pressure information 
generated by the IP itself and embedded in NTTP packets. 

A switch can be configured to meet specific application requirements by 
setting the MINI-ports (Rx or Tx ports, as defined by the MINI interface 
introduced earlier) attributes, routing tables, arbitration mode, and pipelining 
strategy. Some of the features can be software-controlled at runtime through 
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the service network. There is one routing table per Rx port and one arbiter 
per Tx port. Packet switching consists of the following four stages: 


1. Choosing the route—Using relevant information extracted from 
the packet, the routing table selects a target output port. 


2. Arbitrating—Because more than a single input port can request a 
given output port at a given time, an arbiter selects one request- 
ing input port per output port. The arbiter maintains input/output 
connection until the packet completes its transit in the switch. 


3. Switching—Once routing and arbitration decisions have been made, 
the switch transports each word of the packet from its input port to 
its output port. The switch implementation employs a full crossbar, 
ensuring that the switch does not contribute to congestion. 


4. Arbiter release—Once the last word of a packet has been pipelined 


into the crossbar, the arbiter releases the output, making it available 
for other packets that may be waiting at other input ports. 


The simplified block diagram of the switch architecture is shown in 
Figure 11.6. 


43 


4865-8602-9633.4 


Case 2:22-cv-00481-JRG Document 1-14 Filed 12/19/22 Page 44 of 54 PagelD #: 580 


U.S. Patent No. 7,373,449 (Radulescu and Goossens) 
“Apparatus and method for communicating in an integrated circuit” 


‘449 Patent Claim Samsung Product Including Exynos System on Chip! 


Router Architecture 


Input Crossbar Output 
Controller i i Controller 


Control 
Connection 
Control Control 


Target Output Output Crossbar 
Address Number Request (Flow control) 


Route 
Table 


Configuration Supervision Registers | Buftering 
Pipeline stage 
it Service Bus [ r y 
FIGURE 11.6 


Packet transportation unit: Router architecture. 


See Networks-On-Chips Theory and Practice, https:/ /vdoc.pub/download/networks-on-chips- 
theory-and-practice-embedded-multi-core-systems-6f26givv11f0, at 319-320. 
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the In the Arteris NoC in the Exynos SoC included in the Samsung product, the communication 
communication manager forwards the request to a resource manager and the resource manager determines 
manager whether a target connection with the desired connection properties is available, either literally or 
forwarding the under the doctrine of equivalents. 


request to a 
ae manager; | For example, in the Arteris NoC used by Exynos SoC included in the Samsung product, “QoS is 
the resource supported in the switch” which “choos[es] the route” using a “routing table”; “arbitrat[es]”; and 
manager “switch[es]” and the router architecture includes blocks such as “Input Controller,” “Flow 
determining Control,” “Crossbar (Flow control)” “Route Table” and “Arbiter”: 

whether a target 
connection with 
the desired 
connection 
properties is 
available; 
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11.3.3.1 Switching 


The switching is done by accepting NTTP packets carried by input ports and 
forwarding each packet transparently to a specific output port. The switch is 
characterized with a fully synchronous operation and can be implemented 
as a full crossbar (up to one data word transfer per port and per cycle), 
although there is an automatic removal of hardware corresponding to unused 
input/output port connections (port depletion). The switch uses wormhole 
routing, for reduced latency, and can provide full throughput arbitration; that 
is, up to one routing decision per input port and per cycle. An arbitrary num- 
ber of switches can be connected in cascade, supporting any loopless network 
topology. The QoS is supported in the switch using the pressure information 
generated by the IP itself and embedded in NTTP packets. 

A switch can be configured to meet specific application requirements by 
setting the MINI-ports (Rx or Tx ports, as defined by the MINI interface 
introduced earlier) attributes, routing tables, arbitration mode, and pipelining 
strategy. Some of the features can be software-controlled at runtime through 
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the service network. There is one routing table per Rx port and one arbiter 
per Tx port. Packet switching consists of the following four stages: 


1. Choosing the route—Using relevant information extracted from 
the packet, the routing table selects a target output port. 


2. Arbitrating—Because more than a single input port can request a 
given output port at a given time, an arbiter selects one request- 
ing input port per output port. The arbiter maintains input/output 
connection until the packet completes its transit in the switch. 


3. Switching—Once routing and arbitration decisions have been made, 
the switch transports each word of the packet from its input port to 
its output port. The switch implementation employs a full crossbar, 
ensuring that the switch does not contribute to congestion. 


4. Arbiter release—Once the last word of a packet has been pipelined 


into the crossbar, the arbiter releases the output, making it available 
for other packets that may be waiting at other input ports. 


The simplified block diagram of the switch architecture is shown in 
Figure 11.6. 
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Packet transportation unit: Router architecture. 


See Networks-On-Chips Theory and Practice, https:/ /vdoc.pub/download/networks-on-chips- 
theory-and-practice-embedded-multi-core-systems-6f26givv11f0, at 319-320. 
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As a further illustration, in the Arteris NoC, “the pressure information used to define the 
preferred traffic class (QoS) of the requesting inputs... [t]he pressure information is given top 
priority by the switch arbiter” and “the input controller extracts pertinent data from packet 
headers, forwards it to the routing table, fetches back the target output number, and then sends a 
request to the arbiter. After arbitration is granted, the input controller transmits the rest of the 
packet to the crossbar. The request to the arbiter is sustained as long as the last word of the packet 
has not been transferred. Upon transferring the last cell of the packet, the arbiter is allowed to 
select a new input.” 


See Networks-On-Chips Theory and Practice, https:/ /vdoc.pub/download/networks-on-chips- 
theory-and-practice-embedded-multi-core-systems-6f26qivv11f0, at 321, 322. 


the resource In the Arteris NoC in the Exynos SoC included in the Samsung product, the resource manager 
manager responds with the availability of the target connection to the communication manager, either 
responding with literally or under the doctrine of equivalents. 

the availability of 

the target For example, in the Arteris NoC used by the Exynos SoC included in the Samsung product, “the 
connection to the | pressure information used to define the preferred traffic class (QoS) of the requesting inputs... 
communication [t]he pressure information is given top priority by the switch arbiter” and “the input controller 
manager; and extracts pertinent data from packet headers, forwards it to the routing table, fetches back the 


target output number, and then sends a request to the arbiter. After arbitration is granted, the 
input controller transmits the rest of the packet to the crossbar. The request to the arbiter is 
sustained as long as the last word of the packet has not been transferred. Upon transferring the 
last cell of the packet, the arbiter is allowed to select a new input.” 


See Networks-On-Chips Theory and Practice, https:/ /vdoc.pub/download/networks-on-chips- 
theory-and-practice-embedded-multi-core-systems-6f26qivv11f0, at 321, 322. 
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As a further illustration, in the Arteris NoC, “[t]he arbiter ensures that the connection matrix (a 
row per input and a column per output) contains at most one connection per column, that is, a 
given output is not fed by two inputs at the same time. The dual guarantee— at most one 
connection per row —is handled by the input controller. Each output has an arbiter that includes 
prefiltering. For maximum flexibility, each port can specify its own arbiter from the list of 
available arbiters (random, round robin, LRU, FIFO, or fixed priority).” 


Id. at 322-323. 


the target 
connection 
between the first 
and second 
module being 
established based 
on the available 
properties of said 
communication 
channels of said 
connection. 


In the Arteris NoC in the Exynos SoC included in the Samsung product, the target connection 
between the first and second module is established based on the available properties of said 
communication channels of said connection, either literally or under the doctrine of equivalents. 


For example, in the Arteris NoC used by the Exynos SoC included in the Samsung product, “QoS 
is supported in the switch” which “choos[es] the route” using a “routing table”; “arbitrat[es]”; and 


“switch[es]”: 
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11.3.3.1 Switching 


The switching is done by accepting NTTP packets carried by input ports and 
forwarding each packet transparently to a specific output port. The switch is 
characterized with a fully synchronous operation and can be implemented 
as a full crossbar (up to one data word transfer per port and per cycle), 
although there is an automatic removal of hardware corresponding to unused 
input/output port connections (port depletion). The switch uses wormhole 
routing, for reduced latency, and can provide full throughput arbitration; that 
is, up to one routing decision per input port and per cycle. An arbitrary num- 
ber of switches can be connected in cascade, supporting any loopless network 
topology. The QoS is supported in the switch using the pressure information 
generated by the IP itself and embedded in NTTP packets. 

A switch can be configured to meet specific application requirements by 
setting the MINI-ports (Rx or Tx ports, as defined by the MINI interface 
introduced earlier) attributes, routing tables, arbitration mode, and pipelining 
strategy. Some of the features can be software-controlled at runtime through 
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the service network. There is one routing table per Rx port and one arbiter 
per Tx port. Packet switching consists of the following four stages: 


1. Choosing the route—Using relevant information extracted from 
the packet, the routing table selects a target output port. 


2. Arbitrating—Because more than a single input port can request a 
given output port at a given time, an arbiter selects one request- 
ing input port per output port. The arbiter maintains input/output 
connection until the packet completes its transit in the switch. 


3. Switching—Once routing and arbitration decisions have been made, 
the switch transports each word of the packet from its input port to 
its output port. The switch implementation employs a full crossbar, 
ensuring that the switch does not contribute to congestion. 


4. Arbiter release—Once the last word of a packet has been pipelined 


into the crossbar, the arbiter releases the output, making it available 
for other packets that may be waiting at other input ports. 


The simplified block diagram of the switch architecture is shown in 
Figure 11.6. 
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Packet transportation unit: Router architecture. 


See Networks-On-Chips Theory and Practice, https:/ /vdoc.pub/download/networks-on-chips- 
theory-and-practice-embedded-multi-core-systems-6f26givv11f0, at 319-320. 
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In the Arteris NoC, “the pressure information used to define the preferred traffic class (QoS) of 
the requesting inputs... [t]he pressure information is given top priority by the switch arbiter” and 
“the input controller extracts pertinent data from packet headers, forwards it to the routing table, 
fetches back the target output number, and then sends a request to the arbiter. After arbitration is 
granted, the input controller transmits the rest of the packet to the crossbar. The request to the 
arbiter is sustained as long as the last word of the packet has not been transferred. Upon 
transferring the last cell of the packet, the arbiter is allowed to select a new input.” 


See Networks-On-Chips Theory and Practice, https:/ /vdoc.pub/download/networks-on-chips- 
theory-and-practice-embedded-multi-core-systems-6f26qivv11f0, at 321, 322. 


As a further illustration, in the Arteris NoC, “[t]he arbiter ensures that the connection matrix (a 
row per input and a column per output) contains at most one connection per column, that is, a 
given output is not fed by two inputs at the same time. The dual guarantee— at most one 
connection per row —is handled by the input controller. Each output has an arbiter that includes 
prefiltering. For maximum flexibility, each port can specify its own arbiter from the list of 
available arbiters (random, round robin, LRU, FIFO, or fixed priority).” 


Id. at 322-323. 


As a further illustration, in the Arteris NoC, “[t]he crossbar implements datapath connection 
between inputs and outputs. It uses the connection matrix produced by the arbiter to determine 
which connections must be established. It is equivalent to a set of m muxes (one per output port), 
each having n inputs (one per input port).” 


Id. at 323. 
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